Arabic Handwritten Words Off-line Recognition based on HMMs and DBNs
Identifieur interne : 000388 ( Main/Exploration ); précédent : 000387; suivant : 000389Arabic Handwritten Words Off-line Recognition based on HMMs and DBNs
Auteurs : Akram Khémiri [Tunisie] ; Afef Kacem [Tunisie] ; Abdel Belaïd [France] ; Mourad Elloumi [Tunisie]Source :
Abstract
In this work, we investigate the combination of PGM (Propabilistic Graphical Models) classifiers, either independent or coupled, for the recognition of Arabic handwritten words. The independent classifiers are vertical and horizontal HMMs (Hidden Markov Models) whose observable outputs are features extracted from the image columns and the image rows respectively. The coupled classifiers associate the vertical and horizontal observation streams into a single DBN (Dynamic Bayesian Network). A novel method to extract word baseline and a simple and easily extractable features to construct feature vectors for words in the vocabulary are proposed. Some of these features are statistical, based on pixel distributions and local pixel configurations. Others are structural, based on the presence of ascenders, descenders, loops and diacritic points. Experiments on handwritten Arabic words from IFN/ENIT strongly support the feasibility of the proposed approach. The recognition rates achieve 90.42% with vertical and horizontal HMM, 85.03% and 85.21% with respectively a first and a second DBN which outperform results of some works based on PGMs.
Url:
DOI: 10.1109/ICDAR.2015.7333724
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Hal, to step Corpus: 000F65
- to stream Hal, to step Curation: 000F65
- to stream Hal, to step Checkpoint: 000365
- to stream Main, to step Merge: 000388
- to stream Main, to step Curation: 000388
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">Arabic Handwritten Words Off-line Recognition based on HMMs and DBNs</title>
<author><name sortKey="Khemiri, Akram" sort="Khemiri, Akram" uniqKey="Khemiri A" first="Akram" last="Khémiri">Akram Khémiri</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-39219" status="VALID"><orgName>Technologie de l'Information et de la Communication</orgName>
<orgName type="acronym">UTIC</orgName>
<desc><address><addrLine>5, Avenue Taha Hussein, B. P. : 56, Bab Menara, 1008 Tunis</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.esstt.rnu.tn/utic/</ref>
</desc>
<listRelation><relation active="#struct-310013" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-310013" type="direct"><org type="institution" xml:id="struct-310013" status="INCOMING"><orgName>École Supérieure des Sciences et Technologies de Tunis</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
<author><name sortKey="Kacem, Afef" sort="Kacem, Afef" uniqKey="Kacem A" first="Afef" last="Kacem">Afef Kacem</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-39219" status="VALID"><orgName>Technologie de l'Information et de la Communication</orgName>
<orgName type="acronym">UTIC</orgName>
<desc><address><addrLine>5, Avenue Taha Hussein, B. P. : 56, Bab Menara, 1008 Tunis</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.esstt.rnu.tn/utic/</ref>
</desc>
<listRelation><relation active="#struct-310013" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-310013" type="direct"><org type="institution" xml:id="struct-310013" status="INCOMING"><orgName>École Supérieure des Sciences et Technologies de Tunis</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
<author><name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaïd">Abdel Belaïd</name>
<affiliation wicri:level="1"><hal:affiliation type="researchteam" xml:id="struct-206042" status="VALID"><orgName>Recognition of writing and analysis of documents</orgName>
<orgName type="acronym">READ</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
<listRelation><relation active="#struct-423086" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles><tutelle active="#struct-423086" type="direct"><org type="department" xml:id="struct-423086" status="VALID"><orgName>Department of Natural Language Processing & Knowledge Discovery</orgName>
<orgName type="acronym">LORIA - NLPKD</orgName>
<desc><address><country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/Knowledge-and-Language-Management</ref>
</desc>
<listRelation><relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect"><org type="laboratory" xml:id="struct-206040" status="VALID"><idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc><address><addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation><relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect"><org type="institution" xml:id="struct-300009" status="VALID"><orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc><address><addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect"><org type="institution" xml:id="struct-413289" status="VALID"><idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc><address><addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect"><org type="institution" xml:id="struct-441569" status="VALID"><idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc><address><country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName><settlement type="city">Nancy</settlement>
<settlement type="city">Metz</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
</author>
<author><name sortKey="Elloumi, Mourad" sort="Elloumi, Mourad" uniqKey="Elloumi M" first="Mourad" last="Elloumi">Mourad Elloumi</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-442209" status="VALID"><orgName>Laboratoire de Recherche en Technologies de l’Information et de la Communication & Génie Electrique [Tunis]</orgName>
<orgName type="acronym">LaTICE</orgName>
<desc><address><country key="TN"></country>
</address>
<ref type="url">http://www.latice.rnu.tn/</ref>
</desc>
<listRelation><relation active="#struct-51213" type="direct"></relation>
<relation active="#struct-301533" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-51213" type="direct"><org type="institution" xml:id="struct-51213" status="VALID"><orgName>Université de Tunis [Tunis]</orgName>
<desc><address><addrLine>92, Avenue 9 avril 1938, Tunis - 1007</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.utunis.rnu.tn/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301533" type="direct"><org type="institution" xml:id="struct-301533" status="VALID"><orgName>Ecole Supérieure des Sciences et Techniques de Tunis</orgName>
<orgName type="acronym">ESSTT</orgName>
<desc><address><addrLine>Tunis</addrLine>
<country key="TN"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01254724</idno>
<idno type="halId">hal-01254724</idno>
<idno type="halUri">https://hal.inria.fr/hal-01254724</idno>
<idno type="url">https://hal.inria.fr/hal-01254724</idno>
<idno type="doi">10.1109/ICDAR.2015.7333724</idno>
<date when="2015-08-23">2015-08-23</date>
<idno type="wicri:Area/Hal/Corpus">000F65</idno>
<idno type="wicri:Area/Hal/Curation">000F65</idno>
<idno type="wicri:Area/Hal/Checkpoint">000365</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">000365</idno>
<idno type="wicri:Area/Main/Merge">000388</idno>
<idno type="wicri:Area/Main/Curation">000388</idno>
<idno type="wicri:Area/Main/Exploration">000388</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Arabic Handwritten Words Off-line Recognition based on HMMs and DBNs</title>
<author><name sortKey="Khemiri, Akram" sort="Khemiri, Akram" uniqKey="Khemiri A" first="Akram" last="Khémiri">Akram Khémiri</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-39219" status="VALID"><orgName>Technologie de l'Information et de la Communication</orgName>
<orgName type="acronym">UTIC</orgName>
<desc><address><addrLine>5, Avenue Taha Hussein, B. P. : 56, Bab Menara, 1008 Tunis</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.esstt.rnu.tn/utic/</ref>
</desc>
<listRelation><relation active="#struct-310013" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-310013" type="direct"><org type="institution" xml:id="struct-310013" status="INCOMING"><orgName>École Supérieure des Sciences et Technologies de Tunis</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
<author><name sortKey="Kacem, Afef" sort="Kacem, Afef" uniqKey="Kacem A" first="Afef" last="Kacem">Afef Kacem</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-39219" status="VALID"><orgName>Technologie de l'Information et de la Communication</orgName>
<orgName type="acronym">UTIC</orgName>
<desc><address><addrLine>5, Avenue Taha Hussein, B. P. : 56, Bab Menara, 1008 Tunis</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.esstt.rnu.tn/utic/</ref>
</desc>
<listRelation><relation active="#struct-310013" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-310013" type="direct"><org type="institution" xml:id="struct-310013" status="INCOMING"><orgName>École Supérieure des Sciences et Technologies de Tunis</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
<author><name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaïd">Abdel Belaïd</name>
<affiliation wicri:level="1"><hal:affiliation type="researchteam" xml:id="struct-206042" status="VALID"><orgName>Recognition of writing and analysis of documents</orgName>
<orgName type="acronym">READ</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
<listRelation><relation active="#struct-423086" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles><tutelle active="#struct-423086" type="direct"><org type="department" xml:id="struct-423086" status="VALID"><orgName>Department of Natural Language Processing & Knowledge Discovery</orgName>
<orgName type="acronym">LORIA - NLPKD</orgName>
<desc><address><country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/Knowledge-and-Language-Management</ref>
</desc>
<listRelation><relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect"><org type="laboratory" xml:id="struct-206040" status="VALID"><idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc><address><addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation><relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect"><org type="institution" xml:id="struct-300009" status="VALID"><orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc><address><addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect"><org type="institution" xml:id="struct-413289" status="VALID"><idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc><address><addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect"><org type="institution" xml:id="struct-441569" status="VALID"><idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc><address><country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName><settlement type="city">Nancy</settlement>
<settlement type="city">Metz</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
</author>
<author><name sortKey="Elloumi, Mourad" sort="Elloumi, Mourad" uniqKey="Elloumi M" first="Mourad" last="Elloumi">Mourad Elloumi</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-442209" status="VALID"><orgName>Laboratoire de Recherche en Technologies de l’Information et de la Communication & Génie Electrique [Tunis]</orgName>
<orgName type="acronym">LaTICE</orgName>
<desc><address><country key="TN"></country>
</address>
<ref type="url">http://www.latice.rnu.tn/</ref>
</desc>
<listRelation><relation active="#struct-51213" type="direct"></relation>
<relation active="#struct-301533" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-51213" type="direct"><org type="institution" xml:id="struct-51213" status="VALID"><orgName>Université de Tunis [Tunis]</orgName>
<desc><address><addrLine>92, Avenue 9 avril 1938, Tunis - 1007</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.utunis.rnu.tn/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-301533" type="direct"><org type="institution" xml:id="struct-301533" status="VALID"><orgName>Ecole Supérieure des Sciences et Techniques de Tunis</orgName>
<orgName type="acronym">ESSTT</orgName>
<desc><address><addrLine>Tunis</addrLine>
<country key="TN"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
</analytic>
<idno type="DOI">10.1109/ICDAR.2015.7333724</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">In this work, we investigate the combination of PGM (Propabilistic Graphical Models) classifiers, either independent or coupled, for the recognition of Arabic handwritten words. The independent classifiers are vertical and horizontal HMMs (Hidden Markov Models) whose observable outputs are features extracted from the image columns and the image rows respectively. The coupled classifiers associate the vertical and horizontal observation streams into a single DBN (Dynamic Bayesian Network). A novel method to extract word baseline and a simple and easily extractable features to construct feature vectors for words in the vocabulary are proposed. Some of these features are statistical, based on pixel distributions and local pixel configurations. Others are structural, based on the presence of ascenders, descenders, loops and diacritic points. Experiments on handwritten Arabic words from IFN/ENIT strongly support the feasibility of the proposed approach. The recognition rates achieve 90.42% with vertical and horizontal HMM, 85.03% and 85.21% with respectively a first and a second DBN which outperform results of some works based on PGMs.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
<li>Tunisie</li>
</country>
<region><li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement><li>Metz</li>
<li>Nancy</li>
</settlement>
<orgName><li>Université de Lorraine</li>
</orgName>
</list>
<tree><country name="Tunisie"><noRegion><name sortKey="Khemiri, Akram" sort="Khemiri, Akram" uniqKey="Khemiri A" first="Akram" last="Khémiri">Akram Khémiri</name>
</noRegion>
<name sortKey="Elloumi, Mourad" sort="Elloumi, Mourad" uniqKey="Elloumi M" first="Mourad" last="Elloumi">Mourad Elloumi</name>
<name sortKey="Kacem, Afef" sort="Kacem, Afef" uniqKey="Kacem A" first="Afef" last="Kacem">Afef Kacem</name>
</country>
<country name="France"><region name="Grand Est"><name sortKey="Belaid, Abdel" sort="Belaid, Abdel" uniqKey="Belaid A" first="Abdel" last="Belaïd">Abdel Belaïd</name>
</region>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000388 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000388 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Exploration |type= RBID |clé= Hal:hal-01254724 |texte= Arabic Handwritten Words Off-line Recognition based on HMMs and DBNs }}
This area was generated with Dilib version V0.6.33. |